Dimension Table Selection Strategies to Referential Partition a Fact Table of Relational Data Warehouses
نویسنده
چکیده
Enterprise wide data warehouses are becoming increasingly adopted as the main source and underlying infrastructure for business intelligence (BI) solutions. Note that a data warehouse can be viewed as an integration system, where data sources are duplicated in the same repository. Data warehouses are designed to handle the queries required to discover trends and critical factors are called Online Analytical Processing (OLAP) systems. Examples of an OLAP query are: Amazon (www. amazon.com) company analyzes purchases by its customers to come up with an individual screen with products of likely interest to the customer. Analysts at WalMart (www.walmart.com) look for items with increasing sales in some city. Star schemes or their variants are usually used to model warehouse applications. They are composed of thousand of dimension tables and multiple fact tables [15, 18]. Figure 2.1 shows an example of star schema of the widely-known data warehouse benchmark APB-1 release II [21]. Here, the fact table Sales is joint to the following four dimension tables: Product, Customer, Time, Channel. Star queries are typically executed against the warehouse. Queries running on such applications contain a large number of costly joins, selections and aggregations. They are called mega queries [24]. To optimize these queries, the use of advanced optimization techniques is necessary. By analyzing the most important optimization techniques studied in the literature and supported by the most important commercial Database Management Systems (DBMS) like Oracle, SQL Server, DB2, etc. we propose to classify them into two main categories: (1) optimization techniques selected during the creation of data warehouses and (2) optimization techniques selected after the creation of the warehouses. The decision of choosing techniques in the first category is
منابع مشابه
Horizontal Partitioning Selection Problems : Concepts , Algorithms and Advisor Tool
Horizontal partitioning has evolved significantly in recent years and widely advocated by the academic and industrial communities. Horizontal Partitioning affects positively query performance, database manageability and availability. Two types of horizontal partitioning are supported: primary and referential. Horizontal fragmentation in the context of relational data warehouses is to partition ...
متن کاملPrimary and Referential Horizontal Partitioning Selection Problems
Horizontal partitioning has evolved significantly in recent years and widely advocated by the academic and industrial communities. Horizontal Partitioning affects positively query performance, database manageability and availability. Two types of horizontal partitioning are supported: primary and referential. Horizontal fragmentation in the context of relational data warehouses is to partition ...
متن کاملPrimary and Referential Horizontal Partitioning Selection Problems: Concepts, Algorithms and Advisor Tool
Horizontal partitioning has evolved significantly in recent years and widely advocated by the academic and industrial communities. Horizontal Partitioning affects positively query performance, database manageability and availability. Two types of horizontal partitioning are supported: primary and referential. Horizontal fragmentation in the context of relational data warehouses is to partition ...
متن کاملWhy is the Star Schema a Good Data
Database design for data warehouses is based on the notion of the snowwake schema and its important special case, the star schema. The snowwake schema represents a dimensional model which is composed of a central fact table and a set of constituent dimension tables which can be further broken up into subdimension tables. We formalise the concept of a snowwake schema in terms of an acyclic datab...
متن کاملDimensions Based Data Clustering and Zone Maps
In recent years, the data warehouse industry has witnessed decreased use of indexing but increased use of compression and clustering of data facilitating efficient data access and data pruning in the query processing area. A classic example of data pruning is the partition pruning, which is used when table data is range or list partitioned. But lately, techniques have been developed to prune da...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017